IREP++, A Faster Rule Learning Algorithm
نویسندگان
چکیده
We present IREP++, a rule learning algorithm similar to RIPPER and IREP. Like these other algorithms IREP++ produces accurate, human readable rules from noisy data sets. However IREP++ is able to produce such rule sets more quickly and can often express the target concept with fewer rules and fewer literals per rule resulting in a concept description that is easier for humans to understand. The new algorithm is fast enough for interactive training with very large data sets.
منابع مشابه
Fast Eeective Rule Induction
Many existing rule learning systems are computationally expensive on large noisy datasets. In this paper we evaluate the recently-proposed rule learning algorithm IREP on a large and diverse collection of benchmark problems. We show that while IREP is extremely eecient, it frequently gives error rates higher than those of C4.5 and C4.5rules. We then propose a number of modiications resulting in...
متن کاملFast Eective Rule Induction
Many existing rule learning systems are computationally expensive on large noisy datasets In this paper we evaluate the recently proposed rule learning algorithm IREP on a large and diverse collection of benchmark problems We show that while IREP is extremely e cient it frequently gives error rates higher than those of C and C rules We then propose a num ber of modi cations resulting in an algo...
متن کاملPredictive Graph Mining
Graph mining approaches are extremely popular and effective in molecular databases. The vast majority of these approaches first derive interesting, i.e. frequent, patterns and then use these as features to build predictive models. Rather than building these models in a two step indirect way, the SMIREP system introduced in this paper, derives predictive rule models from molecular data directly....
متن کاملMMDT: Multi-Objective Memetic Rule Learning from Decision Tree
In this article, a Multi-Objective Memetic Algorithm (MA) for rule learning is proposed. Prediction accuracy and interpretation are two measures that conflict with each other. In this approach, we consider accuracy and interpretation of rules sets. Additionally, individual classifiers face other problems such as huge sizes, high dimensionality and imbalance classes’ distribution data sets. This...
متن کاملComparative Analysis of Machine Learning Algorithms with Optimization Purposes
The field of optimization and machine learning are increasingly interplayed and optimization in different problems leads to the use of machine learning approaches. Machine learning algorithms work in reasonable computational time for specific classes of problems and have important role in extracting knowledge from large amount of data. In this paper, a methodology has been employed to opt...
متن کامل